Speaker and Gender Identification using Multilingual Speech

نویسندگان

  • Pankaj Kumar Mishra
  • Anupam Shukla
  • Kevin R. Farrell
  • Jayant M. Naik
چکیده

As the demand for multilingual speaker recognizers increases, the development of systems which combine automatic speaker and gender identification, models becomes increasingly important. In this work a speaker and gender identification system is developed using multilingual speech signal as input. MFCCs and delta-MFCCs, LPC, LPCC , Formants ,ZCR are used to build modal for classification and to reduce size of feature vector k-means clustering used. Radial basis function network and multi-layer perceptron are used for classification and their results are compared. Here resilient back propagation algorithm used to train MLP. Two separate modules are used for gender and speaker identification in each experiment. In this experiment accuracy of gender identification is 99% and speaker recognition is 91% using back propagation algorithm and 98% and 92% for gender and speaker identification using radial basis

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker and Gender Identification on Indian Languages using Multilingual Speech

In this paper an attempt is made to develop speaker & gender identification system using continuous speech signal spoken in different languages as input. MFCCs and delta-MFCCs are used to build modal for classification . Radial basis function network is used for classification. Here resilient back propagation algorithm used to train Multilingual Speech signal . Two separate modules are used for...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

Phonetic Speaker Id

This paper describes the exploration of text-independent speaker identification using novel approaches based on speakers’ phonetic features instead of traditional acoustic features. Different phonetic speaker identification approaches are discussed in this paper and evaluated using two speaker identification systems: one multilingual system and one single language multiple-engine system. Furthe...

متن کامل

Improvements in Non-Verbal Cue Identification Using Multilingual Phone Strings

Today’s state-of-the-art front-ends for multilingual speechto-speech translation systems apply monolingual speech recognizers trained for a single language and/or accent. The monolingual speech engine is usually adaptable to an unknown speaker over time using unsupervised training methods; however, if the speaker was seen during training, their specialized acoustic model will be applied, since ...

متن کامل

GlobalPhone: A Multilingual Text & Speech Database in 20 Languages

This paper describes the advances in the multilingual text and speech database GlobalPhone, a multilingual database of highquality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages. GlobalPhone was designed to be uniform across languages with respect to the amount of data, speech quality, the collection scenario, the transcription and phone set convent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015